智能论文笔记

GradICON: Approximate Diffeomorphisms via Gradient Inverse Consistency

Lin Tian , Hastings Greer , François-Xavier Vialard , Roland Kwitt , Raúl San José Estépar , Marc Niethammer

分类：计算机视觉

2022-06-13

许多注册方法都存在着早期工作，重点是基于优化的图像对方法。最近的工作着重于深度注册网络，以预测空间转换。在这两种情况下，通常使用的非参数登记模型，该模型估计转换功能而不是低维转换参数，都需要选择合适的正常器（鼓励平滑转换）及其参数。这使得模型难以调整，并将变形限制为所选正规器允许的变形空间。尽管存在不正常转换的光流的深度学习模型，而是完全依赖于数据，这些模型可能不会产生对医学图像注册期望的差异转换。因此，在这项工作中，我们在无监督的图标深度学习登记方法上开发了Gradicon，该方法仅使用逆矛盾进行正则化。但是，与图标相反，我们证明并从经验上验证，使用梯度反矛盾损失不仅显着改善了收敛性，而且还会导致所得转换图的类似隐式正则化。磁共振（MR）膝关节图像和计算机断层扫描（CT）肺图像的合成实验和实验表明Gradicon的表现出色。我们在保留简单的注册公式的同时，实现了最新的（SOTA）精度，这实际上很重要。

translated by 谷歌翻译

On Measuring Excess Capacity in Neural Networks

Florian Graf , Sebastian Zeng , Bastian Rieck , Marc Niethammer , Roland Kwitt

分类：机器学习 | (统计)机器学习

2022-02-16

我们在监督分类的背景下研究深网的过剩能力。也就是说，给定对基本假设类别的能力度量（在我们的情况下，是经验性的Rademacher的复杂性），我们（先验）可以限制该类别的数量，同时在与无约束性方面保持经验误差的同时保留经验误差？为了评估现代体系结构（例如残留网络）的过剩能力，我们扩展并统一了先前的Rademacher复杂性界限，以适应功能组成和添加以及卷积的结构。我们边界中的容量驱动项是层的Lipschitz常数和卷积权重初始化的（2,1）组的范围距离。在不同任务难度的基准数据集上进行的实验表明，（1）每个任务的容量大量超过容量，并且（2）可以将容量保持在整个任务的惊人相似水平。总体而言，这表明了重量规范的可压缩性概念，这是通过重量修剪正交的经典压缩概念。

translated by 谷歌翻译

Efficient aggregation of face embeddings for decentralized face recognition deployments (extended version)

Philipp Hofer , Michael Roland , Philipp Schwarz , Renè Mayrhofer

分类：人工智能 | 计算机视觉

2022-12-20

Biometrics are one of the most privacy-sensitive data. Ubiquitous authentication systems with a focus on privacy favor decentralized approaches as they reduce potential attack vectors, both on a technical and organizational level. The gold standard is to let the user be in control of where their own data is stored, which consequently leads to a high variety of devices used. Moreover, in comparison with a centralized system, designs with higher end-user freedom often incur additional network overhead. Therefore, when using face recognition for biometric authentication, an efficient way to compare faces is important in practical deployments, because it reduces both network and hardware requirements that are essential to encourage device diversity. This paper proposes an efficient way to aggregate embeddings used for face recognition based on an extensive analysis on different datasets and the use of different aggregation strategies. As part of this analysis, a new dataset has been collected, which is available for research purposes. Our proposed method supports the construction of massively scalable, decentralized face recognition systems with a focus on both privacy and long-term usability.

translated by 谷歌翻译

Fast Converging Anytime Model Counting

Yong Lai , Kuldeep S. Meel , Roland H. C. Yap

分类：人工智能

2022-12-19

Model counting is a fundamental problem which has been influential in many applications, from artificial intelligence to formal verification. Due to the intrinsic hardness of model counting, approximate techniques have been developed to solve real-world instances of model counting. This paper designs a new anytime approach called PartialKC for approximate model counting. The idea is a form of partial knowledge compilation to provide an unbiased estimate of the model count which can converge to the exact count. Our empirical analysis demonstrates that PartialKC achieves significant scalability and accuracy over prior state-of-the-art approximate counters, including satss and STS. Interestingly, the empirical results show that PartialKC reaches convergence for many instances and therefore provides exact model counting performance comparable to state-of-the-art exact counters.

translated by 谷歌翻译

An annotated instance segmentation XXL-CT dataset from a historic airplane

Roland Gruber , Nils Reims , Andreas Hempfer , Stefan Gerth , Michael Salamon , Thomas Wittenberg

分类：计算机视觉

2022-12-16

The Me 163 was a Second World War fighter airplane and a result of the German air force secret developments. One of these airplanes is currently owned and displayed in the historic aircraft exhibition of the Deutsches Museum in Munich, Germany. To gain insights with respect to its history, design and state of preservation, a complete CT scan was obtained using an industrial XXL-computer tomography scanner. Using the CT data from the Me 163, all its details can visually be examined at various levels, ranging from the complete hull down to single sprockets and rivets. However, while a trained human observer can identify and interpret the volumetric data with all its parts and connections, a virtual dissection of the airplane and all its different parts would be quite desirable. Nevertheless, this means, that an instance segmentation of all components and objects of interest into disjoint entities from the CT data is necessary. As of currently, no adequate computer-assisted tools for automated or semi-automated segmentation of such XXL-airplane data are available, in a first step, an interactive data annotation and object labeling process has been established. So far, seven 512 x 512 x 512 voxel sub-volumes from the Me 163 airplane have been annotated and labeled, whose results can potentially be used for various new applications in the field of digital heritage, non-destructive testing, or machine-learning. This work describes the data acquisition process of the airplane using an industrial XXL-CT scanner, outlines the interactive segmentation and labeling scheme to annotate sub-volumes of the airplane's CT data, describes and discusses various challenges with respect to interpreting and handling the annotated and labeled data.

translated by 谷歌翻译

Resilient Terrain Navigation with a 5 DOF Metal Detector Drone

Patrick Pfreundschuh , Rik Bahnemann , Tim Kazik , Thomas Mantel , Roland Siegwart , Olov Andersson

分类：机器人

2022-12-14

Micro aerial vehicles (MAVs) hold the potential for performing autonomous and contactless land surveys for the detection of landmines and explosive remnants of war (ERW). Metal detectors are the standard tool, but have to be operated close to and parallel to the terrain. As this requires advanced flight capabilities, they have not been successfully combined with MAVs before. To this end, we present a full system to autonomously survey challenging undulated terrain using a metal detector mounted on a 5 degrees of freedom (DOF) MAV. Based on an online estimate of the terrain, our receding-horizon planner efficiently covers the area, aligning the detector to the surface while considering the kinematic and visibility constraints of the platform. For resilient localization, we propose a factor-graph approach for online fusion of GNSS, IMU and LiDAR measurements. A simulated ablation study shows that the proposed planner reduces coverage duration and improves trajectory smoothness. Real-world flight experiments showcase autonomous mapping of buried metallic objects in undulated and obstructed terrain. The proposed localization approach is resilient to individual sensor degeneracy.

translated by 谷歌翻译

A Multi-Segment, Soft Growing Robot with Selective Steering

Alexander M. Kübler , Sebastián Urdaneta Rivera , Frances B. Raphael , Julian Förster , Roland Siegwart , Allison M. Okamura

分类：机器人

2022-12-07

Everting, soft growing vine robots benefit from reduced friction with their environment, which allows them to navigate challenging terrain. Vine robots can use air pouches attached to their sides for lateral steering. However, when all pouches are serially connected, the whole robot can only perform one constant curvature in free space. It must contact the environment to navigate through obstacles along paths with multiple turns. This work presents a multi-segment vine robot that can navigate complex paths without interacting with its environment. This is achieved by a new steering method that selectively actuates each single pouch at the tip, providing high degrees of freedom with few control inputs. A small magnetic valve connects each pouch to a pressure supply line. A motorized tip mount uses an interlocking mechanism and motorized rollers on the outer material of the vine robot. As each valve passes through the tip mount, a permanent magnet inside the tip mount opens the valve so the corresponding pouch is connected to the pressure supply line at the same moment. Novel cylindrical pneumatic artificial muscles (cPAMs) are integrated into the vine robot and inflate to a cylindrical shape for improved bending characteristics compared to other state-of-the art vine robots. The motorized tip mount controls a continuous eversion speed and enables controlled retraction. A final prototype was able to repeatably grow into different shapes and hold these shapes. We predict the path using a model that assumes a piecewise constant curvature along the outside of the multi-segment vine robot. The proposed multi-segment steering method can be extended to other soft continuum robot designs.

translated by 谷歌翻译

maplab 2.0 -- A Modular and Multi-Modal Mapping Framework

Andrei Cramariuc , Lukas Bernreiter , Florian Tschopp , Marius Fehr , Victor Reijgwart , Juan Nieto , Roland Siegwart , Cesar Cadena

分类：机器人

2022-12-01

Integration of multiple sensor modalities and deep learning into Simultaneous Localization And Mapping (SLAM) systems are areas of significant interest in current research. Multi-modality is a stepping stone towards achieving robustness in challenging environments and interoperability of heterogeneous multi-robot systems with varying sensor setups. With maplab 2.0, we provide a versatile open-source platform that facilitates developing, testing, and integrating new modules and features into a fully-fledged SLAM system. Through extensive experiments, we show that maplab 2.0's accuracy is comparable to the state-of-the-art on the HILTI 2021 benchmark. Additionally, we showcase the flexibility of our system with three use cases: i) large-scale (approx. 10 km) multi-robot multi-session (23 missions) mapping, ii) integration of non-visual landmarks, and iii) incorporating a semantic object-based loop closure module into the mapping framework. The code is available open-source at https://github.com/ethz-asl/maplab.

translated by 谷歌翻译

Applying Deep Reinforcement Learning to the HP Model for Protein Structure Prediction

Kaiyuan Yang , Houjing Huang , Olafs Vandans , Adithya Murali , Fujia Tian , Roland H. C. Yap , Liang Dai

分类：机器学习

2022-11-27

A central problem in computational biophysics is protein structure prediction, i.e., finding the optimal folding of a given amino acid sequence. This problem has been studied in a classical abstract model, the HP model, where the protein is modeled as a sequence of H (hydrophobic) and P (polar) amino acids on a lattice. The objective is to find conformations maximizing H-H contacts. It is known that even in this reduced setting, the problem is intractable (NP-hard). In this work, we apply deep reinforcement learning (DRL) to the two-dimensional HP model. We can obtain the conformations of best known energies for benchmark HP sequences with lengths from 20 to 50. Our DRL is based on a deep Q-network (DQN). We find that a DQN based on long short-term memory (LSTM) architecture greatly enhances the RL learning ability and significantly improves the search process. DRL can sample the state space efficiently, without the need of manual heuristics. Experimentally we show that it can find multiple distinct best-known solutions per trial. This study demonstrates the effectiveness of deep reinforcement learning in the HP model for protein folding.

translated by 谷歌翻译

SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration

Giulia Vezzani , Dhruva Tirumala , Markus Wulfmeier , Dushyant Rao , Abbas Abdolmaleki , Ben Moran , Tuomas Haarnoja , Jan Humplik , Roland Hafner , Michael Neunert

分类：机器学习 | 人工智能 | 机器人

2022-11-24

The ability to effectively reuse prior knowledge is a key requirement when building general and flexible Reinforcement Learning (RL) agents. Skill reuse is one of the most common approaches, but current methods have considerable limitations.For example, fine-tuning an existing policy frequently fails, as the policy can degrade rapidly early in training. In a similar vein, distillation of expert behavior can lead to poor results when given sub-optimal experts. We compare several common approaches for skill transfer on multiple domains including changes in task and system dynamics. We identify how existing methods can fail and introduce an alternative approach to mitigate these problems. Our approach learns to sequence existing temporally-extended skills for exploration but learns the final policy directly from the raw experience. This conceptual split enables rapid adaptation and thus efficient data collection but without constraining the final solution.It significantly outperforms many classical methods across a suite of evaluation tasks and we use a broad set of ablations to highlight the importance of differentc omponents of our method.

translated by 谷歌翻译